Inter-Rater Reliability: Dependency on Trait Prevalence and Marginal Homogeneity
نویسنده
چکیده
Researchers have criticized chance-corrected agreement statistics, particularly the Kappa statistic, as being very sensitive to raters’ classification probabilities (marginal probabilities) and to trait prevalence in the subject population. Consequently, several authors have suggested that marginal probabilities be tested for homogeneity and that any comparison between reliability studies be preceded by an assessment of trait prevalence among subjects. The objective of this paper is threefold: (i) to demonstrate that marginal homogeneity testing does not prevent the unpredictable results often obtained with some of the most popular agreement statistics, (ii) to present a simple and reliable inter-rater agreement statistic, and (iii) to gain further insight into the dependency of agreement statistics upon trait prevalence.
منابع مشابه
A comparison of Cohen’s Kappa and Gwet’s AC1 when calculating inter-rater reliability coefficients: a study conducted with personality disorder samples
BACKGROUND Rater agreement is important in clinical research, and Cohen's Kappa is a widely used method for assessing inter-rater reliability; however, there are well documented statistical problems associated with the measure. In order to assess its utility, we evaluated it against Gwet's AC1 and compared the results. METHODS This study was carried out across 67 patients (56% males) aged 18 ...
متن کاملEvaluation of Spasticity Using the Ashworth Scale with Intermediate Scores (ASIS)
Objectives: The main purpose of this research was to study and contribute to an accurate test of spastic limb. The intra, inter rater reliability of the test was examined. Methods: The present study was carried out in two parts In the first part of the study, the modified Ashworth Scale with Intermediate Scores (ASIS) was studied. During the second part of the study the intra, inter rater re...
متن کاملReliability of light microscopy and a computer-assisted replica measurement technique for evaluating the fit of dental copings
The aim of this in vitro study was to assess the reliability of two measurement systems for evaluating the marginal and internal fit of dental copings. Sixteen CAD/CAM titanium copings were produced for a prepared maxillary canine. To modify the CAD surface model using different parameters (data density; enlargement in different directions), varying fit was created. Five light-body silicone rep...
متن کاملTest-Retest and Inter-Rater Reliability Study of the Schedule for Oral-Motor Assessment in Persian Children
Objectives: Reliable and valid clinical tools to screen, diagnose, and describe eating functions and dysphagia in children are highly warranted. Today most specialists are aware of the role of assessment scales in the treatment of affected individuals. However, the problem is that the clinical tools used might be nonstandard, and worldwide, there is no integrated assessment performed to assess ...
متن کاملInter-rater and intra-rater reliability in the interpretation of MTI Photoscreener photographs of Native American preschool children.
PURPOSE To evaluate inter- and intra-rater reliability for the interpretation of MTI Photoscreener photographs taken in a population of Native American preschool children with a high prevalence of astigmatism. METHODS Photographs of 369 children were rated by 11 nonexpert and 3 expert raters. Photographs for each child were scored as pass, refer, or retake. Nonexpert raters scored photos on t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002